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Abstract. First, we extend Leiler-Milner RPO theory, by giving general conditions to 
obtain IPO labeled transition systems (and bisimilarities) with a reduced set of transitions, 
and possibly finitely branching. Moreover, we study the weak variant of Leifer-Milner 
theory, by giving general conditions under which the weak bisimilarity is a congruence. 
Then, we apply such extended RPO technique to the lambda-calculus, endowed with lazy 
and call by value reduction strategies. We show that, contrary to process calculi, one 
can deal directly with the lambda-calculus syntax and apply Leifer-Milner technique to 
a category of contexts, provided that we work in the framework of weak bisimilarities. 
However, even in the case of the transition system with minimal contexts, the resulting 
bisimilarity is infinitely branching, due to the fact that, in standard context categories, 
parametric rules such as the beta-rule can be represented only by infinitely many ground 
rules. To overcome this problem, we introduce the general notion of second-order context 
category. We show that, by carrying out the RPO construction in this setting, the lazy 
observational equivalence can be captured as a weak bisimilarity equivalence on a finitely 
branching transition system. This result is achieved by considering an encoding of lambda- 
calculus in Combinatory Logic. 



Recently, much attention has been devoted to derive labeled transition systems and 
bisimilarity congruences from reactive systems, in the context of process languages and 
graph rewriting, |Sew02[ ILMOOi ISSM IGM051 IBGK06L IBKM061 IEK06j . In the theory of 
process algebras, the operational semantics of CCS was originally given via a labeled transi- 
tion system (Its), while more recent process calculi have been presented via reactive systems 
plus structural rules. Reactive systems naturally induce behavioral equivalences which are 
congruences w.r.t. contexts, while Its's naturally induce bisimilarity equivalences with coin- 
ductive characterizations. However, such equivalences are not congruences in general, or 
else it is an heavy, ad-hoc task to prove that they are congruences. 
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Generalizing |Sew02| , Leifer and Milner |LMOO| presented a general categorical method 
for deriving a transition system from a reactive system, in such a way that the induced 
bisimilarity is a congruence. The labels in Leifer-Milner's transition system are those con- 
texts which are minimal for a given reaction to fire. Minimal contexts are identified via 
the categorical notion of relative pushout (RPO). Leifer-Milner's central result guarantees 
that, under a suitable categorical condition, the induced bisimilarity is a congruence w.r.t. 
all contexts. 

In the literature, some case studies have been carried out, especially in the setting of 
process calculi, for testing the expressivity of Leifer-Milner's approach. Some difficulties 
have arisen in applying the approach directly to such languages, viewed as Lawvere theo- 
ries, because of structural rules. To overcome this problem, two different approaches have 
been considered. The first approach consists in using more complex categorical construc- 
tions, where structural rules are accounted for explicitly, [LeiOH ISS05] . In the second 
approach, intermediate encodings have been considered in graph theory, for which the ap- 
proach of "borrowed contexts" has been developed |EK06j . and in Milner 's bigraph theory. 
Here structural rules are avoided, since structurally equivalent terms are equated in the 
target language. 

Moreover, the following further issues have arisen in applying Leifer-Milner's technique. 

(i) Leifer-Milner's bisimilarity is still redundant, and many labels have to be eliminated 
a posteriori, by an ad-hoc reasoning. Thus general results are called for, in order to 
reduce the complexity of the bisimilarity a priori. 

(ii) In some cases it is useful to consider weak variants of Leifer- Milner technique. However, 
for the weak bisimilarity we only have a partial congruence result, stating that such 
bisimilarity is a congruence w.r.t. a certain class of contexts. However, in many 
concrete cases, the weak bisimilarity turn out to be a full congruence. Thus it will be 
useful to study general conditions under which this happens. 

(iii) When Leifer-Milner technique is applied in the standard setting of term and context 
categories (Lawvere theories), the rules in the rewriting system cannot be represented 
parametrically, but only at a ground level through a (infinite) series of possible instan- 
tiations. As a consequence, the bisimilarity turns out to be infinitely branching. In 
[KSS05] , a generalization of Leifer-Milner technique for dealing with parametric rules 
has been introduced. This approach is rather complex and not completely satisfac- 
tory. An alternative approach (which is considered in the present paper) consists in 
studying second-order versions of term and context categories, which allow paramet- 
ric representations of rewriting rules, and carrying out Leifer-Milner technique in this 
setting. 

In this paper, we address all the above issues. In particular, in the first part of the paper, 
we extend Leifer-Milner theory, by providing general results for reducing the complexity 
of the bisimilarity, and by studying conditions under which the weak bisimilarity is a full 
congruence. Then, we focus on the prototypical example of reactive system given by the 
A-calculus, endowed with lazy and call by value (cbv) reduction strategies. We show that, 
in principle, contrary to most of the case studies considered in the literature, one could 
deal directly with the A-calculus syntax and apply Leifer-Milner technique to the category 
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of term contexts induced by the A-terms, provided that we work in the setting of weak 
bisimilarities. Applying our general results, we get quite economical weak bisimilarities 
which are congruences and we recover exactly both lazy and cbv contextual equivalences. 
As a by-product, we also get an alternative proof of the Context Lemma for the lazy case. 
However, the bisimilarities that we obtain are still infinitely branching. This is mainly due 
to the fact that, in the category of contexts, the /3-rule cannot be described parametrically, 
but it needs to be described extensionally using an infinite set of pairs of ground terms. In 
order to overcome this problem, we consider the combinatory logic and we introduce the 
general notion of category of second-order term contexts, which provide a solution to the 
third issue above. Our main result amounts to the fact that, by carrying out Leifer-Milner's 
construction in this setting, the lazy contextual equivalence can be captured as a weak 
bisimilarity equivalence on a (finitely branching) transition system, while for the cbv case, 
the finitely branching transition system induces a bisimilarity which is strictly included 
in the contextual equivalence. Technically, these results are achieved by considering an 
encoding of the lazy (cbv) A-calculus in KS Combinatory Logic (CL), endowed with a lazy 
(cbv) reduction strategy, and by showing that the lazy (cbv) contextual equivalence on 
A-calculus can be recovered as a lazy (cbv) equivalence on CL. It is necessary to consider 
such encoding, since the approach of second-order context categories proposed in this paper 
works for reaction rules which are "local", that is, the reaction does not act on the whole 
term, but only locally. But the substitution operation on A-calculus is not local. 

Finally, the correspondence results obtained in this paper about the observational equiv- 
alences on A-calculus and CL are interesting per se and, although natural and ultimately 
elementary, had not appeared previously in the literature. 

Summary. In Section [21 we summarize the theory of reactive systems of |LM00j . In Sec- 
tion [3l we extend such theory with new general results about weak bisimilarity, and about 
the "pruning" of Leifer-Milner Its and the induced bisimilarity. In Section HI we present the 
A-calculus together with lazy and cbv reduction strategies and observational equivalences, 
and we discuss the RPO approach applied to the A-calculus endowed with a structure of 
context category. In Section [5l we focus on Combinatory Logic (CL), we show how to re- 
cover on CL the lazy and cbv strategies and observational equivalences, and we discuss the 
RPO approach applied to CL, viewed as a context category. In Section [6l we introduce the 
notion of second-order context category, and we apply the RPO approach to CL viewed as 
a second-order rewriting system, thus obtaining a characterization of the lazy observational 
equivalence as a weak bisimilarity on a finitely branching Its. Final remarks and directions 
for future work appear in Section [71 

The present paper extends [DHLOSj . The main new contribution of the present paper 
is the extension of Leifer-Milner theory, which appears in Section [31 This allows to deal 
with the A-calculus in the subsequent sections in a smoother way, to get stronger results 
about the Its and the induced bisimilarity, both for the lazy and for the cbv case, and also 
to provide an alternative proof of the Context Lemma in the lazy case. 
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2. The Theory of Reactive Systems 

In this section, we summarize the theory of reactive systems proposed in [LMOO] to 
derive Its's and bisimulation congruences from a given reduction semantics. Moreover, we 
discuss weak variants of Leifer-Milner's bisimilarity equivalence. 

The theory of [LMOO] is based on a categorical formulation of the notion of reactive 
system, whereby contexts are modeled as arrows of a category, terms are arrows having as 
domain (a special object which denotes no holes), and reaction rules are pairs of terms. 

Definition 2.1 (Reactive System). A reactive system C consists of: 

• a category C; 

• a distinguished object € \C\; 

• a composition-reflecting subcategory V of reactive contexts; 

• a set of pairs R C (J/g|c| ^[^^ ^] ^ ^I^' -^1 reaction rules. 

The reactive contexts are those in which a reaction can occur. By composition-reflecting 
we mean that dd' € D implies d,d' G T>. 

Reactive systems on term languages can be viewed as a special case of reactive systems 
in the sense of Leifer-Milner by instantiating C as a suitable category of term and contexts, 
also called the (free) Lawvere category, |LM00j . In this view, we often call terms the arrows 
with domains 0, and contexts the other arrows. 

From the set of reaction rules one generates the reaction relation by closing them under 
all reactive contexts: 

Definition 2.2 (Reaction Relation). Given a reaction system with reactive contexts V and 
reaction rules R, the reaction relation — > is defined by: 

t ^ u iff t = dl, u = dr for some d and r) G R . 

The behavior of a reactive system is expressed as an unlabeled transition system. On 
the other hand, many useful behavioral equivalences are only defined for Its's. The passage 
from reactive systems to Its's is obtained as follows. 

Definition 2.3 (Context Labeled Transition System). Given a reactive system C, the 
associated context Its is defined as follows: 

• states: arrows t : ^ / in C, for any /; 

• transitions: t — ^-c u iff c S C and ct ^ u (i.e., ct and u are in the reaction relation). 

In the case of a reactive system defined on a category of contexts, a state is a term t, and 
an associated label is a context c such that ct reduces. In the following, we will consider 
also Its's obtained by reducing the set of transitions of the context Its. In the sequel, we 
will use the word Its to refer to any such Its obtained from a context Its. 
Any Its induces a bisimilarity relation as follows: 

Definition 2.4 (Bisimilarity). Let — ^ be a Its. 



RPO, SECOND-ORDER CONTEXTS, AND A-CALCULUS * 



5 






(^) (^^) (m) (iv) 

Figure 1: Redex Square and Relative Pushout. 

(i) A symmetric relation TZ C [Jj^^^ C(0, /) xC(0, /) on the states of the Its is a bisimulation 
if: 

{a,b) £n A a^a' =^ 3b'. b ^ b' A {a', b') e TZ . 

(ii) We call bisimilarity the largest bisimulation. 

(iii) The bisimilarity on the context Its is called context bisimilarity ~c'. 

It is easy to check that the context bisimilarity is a congruence w.r.t. all contexts, i.e., 
if a ~c b, then for any context c, ca cb. However, intuitively only those contexts 
which contain the minimal amount of information for a reaction to fire are relevant, while 
the others are redundant. Moreover, often context bisimilarity gives an equivalence which 
is too coarse, as we will see also in this paper. Thus, in [LMOO] . the authors proposed a 
categorical criterion for identifying the "smallest context allowing a reaction" . They defined 
relative pushouts (RPOs), of which idem relative pushouts (IPOs) are a special case. One 
can define a Its using IPOs. Leifer-Milner's central result consists in showing that, under 
a suitable categorical condition, such Its is well-behaved, in the sense that the induced 
bisimilarity is a congruence. 

Definition 2.5 (RPO/IPO). 

(i) Let C be a category and let us consider the commutative diagram in Fig. H^i). Any 
tuple {l5,e,f,g) which makes diagram in Fig. mii) commute is called a candidate 
for (i). A relative pushout (RPO) is the smallest such candidate, i.e., it satisfies the 
universal property that given any other candidate {lQ,e' , f , g'), there exists a unique 
mediating morphism h : ^ such that both diagrams in Fig. [Hiii) and Fig. HKiv) 
commute. 

(ii) A commutative square such as diagram (i) in Fig [1] is an idem pushout (IPO) if 
(/4, c, d, idj^) is its RPO. 

Definition 2.6 (IPO Transition System). 

(1) States: arrows t : ^ / in C, for any /; 

(2) Transitions: t — ^/ dr iff d & V, ct = dl, {l,r) G R and the diagram in Fig. is an 
IPO. 
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Figure 2: IPO pasting. 



Let ~/ denote the bisimilarity induced by the IPO Its. 

Definition 2.7 (Redex Square). Let C be a reactive system and f : — > I2 an arrow in 
C. A redex square (see Fig. Hl^i)) consists of a left-hand side / : ^ Is of a reaction rule 
(Z : — 5- Is, r : — > I3) G R, a context c : I2 ^ h and a reactive context d : I3 —>■ such 
that ct = dl. 

A reactive system C is said to have redex RPOs if every redex square has an RPO. 
The following is a fundamental lemma stating a property of IPO squares. 



Lemma 2.8 (IPO pasting, |LM00j ). Suppose that the square in Fig. \^i) has an RPO and 

that both squares in Fig. \^ii) commute. 

(i) If the two squares of Fig. \^ii) are IPOs so is the outer rectangle. 

(ii) It the outer rectangle and the left square of Fig. \^ii) are IPOs so is the right square. 

From the above lemma Leifer and Milner derived their central result: 



Theorem 2.9 ( |LMOO] ) . Let C he a reactive system having redex RPOs. Then the IPO 
bisimilarity ~/ is a congruence w.r.t. all contexts, i.e., if a b then for all c of the 
appropriate type, ca ~/ cb. 



2.1. Weak Bisimilarity. For dealing with the A-calculus, it will be useful to consider the 
weak versions of the context and IPO Its's defined above, together with the corresponding 
notions of weak bisimilarities. 

One can proceed in general, by defining a weak Its from a given Its: 



Definition 2.10 (Weak Its and Bisimilarity). Let 
tifying an unobservable action). 

(i) We define the weak Its =^ by 



be a Its, and let r be a label (iden- 



t 



u iff 



5- U 
. * 4./ 



t' 



u 



u 



if a = r 
otherwise 



where — >* denotes the reflexive and transitive closure of — >. 
(ii) Let us call weak bisimilarity the bisimilarity induced by the weak Its. 
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The above definition differs from the one proposed in [LMOOj . where, in case a 7^ r, ==^ is 
defined by o — We cannot use the latter, since it discriminates A-terms which are 
equivalent in the usual semantics. 

The following easy lemma gives a useful characterization of the weak bisimilarity, 
whereby any -^-transition is mimicked by a =^-transition: 

Lemma 2.11. Let be a Its and let =^ be the corresponding weak Its. The induced weak 
bisimilarity is the greatest symmetric relation TZ s.t.: 

{a,b) en A a^a' =^ 3b'. b b' A (a', b') £ TZ . 

The following lemma provides a coinduction "up-to" principle, which will be useful in 
the sequel: 

Lemma 2.12. Let be a Its and let =^ be the corresponding weak Its. The induced weak 
bisimilarity is the greatest symmetric relation TZ s.t.: 

{a,b) en A a d^'a' =^ 3b'. b b' A {a',b') G n* , 

where denotes — ^* o (f is possibly t), andn* denotes the reflexive and transitive 
closure ofn. 

Proof. Let us call "bisimulation up-to" a relation n as in the statement of the lemma. In 
order to prove the claim, it is sufficient to prove that, if 7^ is a bisimulation up-to, then 
n* is a bisimulation. Let 7^ be a bisimulation up-to. First, one can easily check that 
{an*b A a ^ a') =^ 3b'. {b ^ b' A a'n*b') (by induction on the length of the 
chain a n . . .n b). Now, lei a = n ai . . . On-i n On = b and a =^ a' . We prove that 
3b'. {b^b' A a'n*b'), by induction on n > 0. If n = 0, the claim is immediate. If n > 
and ad^' a" a', then, since n is a bisimulation up-to, ai =^ a'( A a"n*a'(, and, by 
what we have proved before, 3a'^. {a'{ =^ a'-^ A a'n*a'^). Finally, by induction hypothesis, 
3b'. {bd^b' A a'{n*b'). Hence a'7^*6'. □ 

For dealing with the A-calculus, we will consider a notion of weak IPO bisimilarity, 
where the identity context is unobservable. Such notions of weak IPO bisimilarities are 
not congruences w.r.t. all contexts, in general, however, as observed in [LMOOj (end of 
Section 5), they are congruences at least w.r.t. reactive contexts: 

Theorem 2.13. Let C be a reactive system having redex RPOs. Then the weak IPO 
bisimilarity ^i, where the identity context is unobservable, is a congruence w.r.t. reactive 
contexts. 

3. Extending the Theory of Reactive Systems 

In this section, we present some original results concerning the Its obtained by the RPO 
construction. These results concern two issues: 

Weak-bisimilarity: Since in the A-calculus the weak bisimilarity is the equivalence to be 
used, we present some general conditions assuring that the weak bisimilarity, on the Its 
obtained by an IPO construction, is a congruence w.r.t. all contexts. 
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Pruning the Its tree: In order to obtain a feasible Its, i.e., a Its with a reduced set of 
transitions, possibly finitely branching, it is often necessary to prune the Its obtained 
by an IPO construction. We present some general conditions allowing to prune IPO Its, 
without modifying the induced (weak)-bisimilarity. 

We present our results in two different versions, the first one is quite simple, but it does not 
apply to our particular case, so we present a second version that is more involved but suits 
our needs. We choose to present the simple first version of our results as an introduction to 
the second one, and also because it can have applications in modeling languages different 
from the A-calculus. 

Some preliminary definitions are necessary. 

Definition 3.1. Given a Its obtained by the IPO construction: 

• Given a set of labels L, the L-restricted IPO Its is the Its obtained by removing from 
the IPO Its all transitions not labeled by elements in L. We denote by «l the weak 
bisimilarity induced by the L-restricted IPO Its. 

• We denote by R the set of labels that are reactive contexts. We denote by the weak 
bisimilarity induced by the i?-restricted IPO Its. 

• In a reactive system, we say that the family of IPO transitions with label / : Iq ^ /i is 
definable by contexts if there exists a list of contexts ei, . . . ,eh ■ Iq ^ h such that, for all 
t : ^ Iq, we have that: Vi. t eit and t t' =^ 3i. t' = Cit. 

Intuitively, a family of IPO transitions with label / : /q — > /i is definable by contexts if / is 
an IPO for any arrow t : ^ Iq and the IPO transitions on / can be described by contexts, 
that is, they do not modify the internal structure of the term t. 

Proposition 3.2. Let C be a reactive system having redex RPOs. If any IPO context is 
either reactive or definable by contexts (or both), then the weak IPO bisimilarity ~7 (with 
the identity IPO context unobservable) is a congruence. Moreover ~7 coincides with 

Proof. Consider the relation S = {{ct,cu) \ t n, c context}. It is immediate that 
and from this, ~/C {{ct,cu) \ t u, c context} C S. If we prove also the 
inclusion S Q^j, then all relations are equal and ~/ coincides with its contextual closure, 
i.e., it is congruence. By Lemma [2. Ill in order to prove S — ~y it is sufficient to show that, 
for any {ct, cu) G S, if ct — t' then there exists u' s.t. cu u' with t'Su' . 
Consider the following diagram: 




d' 



where the outermost rectangle is the IPO inducing the transition ct — t', namely t' = d'dr 



with r) a reaction rule, while the left square is a RPO of the redex square. By Lemma [2^ 
the IPO pasting property, we have that also the right-hand square of the diagram is an IPO 
There are two cases to consider: 
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(i) If the context /' is definable by contexts, since — >j dr, there exists a context e such 
that dr = et and t' = d'et, it follows that u —^j eu. That is, there exist a reaction 
rule (/i,ri) and a reactive context di s.t. eu = diri, and the left-hand square of the 
following diagram is a IPO. 




d' 

Since the right-hand square is IPO, by the IPO pasting property, Lemma [2.8( also the 
outermost rectangle is an IPO. It follows that cu — d'diVi = d'eu, which implies 
the claim. 

(ii) If the context /' is reactive, then it so also the context d' f (composition of reactive 
contexts) and the context c (reactive contexts are composition-reflecting). Moreover, 
by the definition of bisimilarity, there exists uq such that u uq (which means 

u —^i* ui — ^/ U2 — uq) with Mo dr. Since c is reactive and squares of the 




are IPOs, by composition of IPO squares (and by induction) it is easy to prove that 
cu —^i* cui — ^/ d'u2 — d'uo, which implies the claim. □ 

For dealing with the A-calculus, we present a second proposition that is similar in spirit 
to Proposition [321 although it is not a direct generalization. The second proposition consid- 
ers both the category of unary linear term contexts and a category of "multi-holed" linear 
term contexts. The category of unary contexts is the most suitable for the IPO construc- 
tion, while the category of multi-holed contexts is useful to represent some transitions (in 
the Its) through insertions of terms in suitable contexts. 

The following definition formalizes the relation existing between the two categories of 
contexts. 

Definition 3.3. A category D is a list extension of a category C if the following hold: 

• C contains a distinguished object 0. 

• The objects of T> are finite lists of objects of C different from 0. 

• By identifying with the empty list ( ), and any other object I in C with the singleton 
list (/), C is a full subcategory of T>. 

• There exists a concatenation functor ® from PxP to T) acting as concatenation on objects 
(Iq, • • • , In) ® {Jqi • • • 1 Jm) = (-^0) ■ ■ ■ , In, Jq, ■ ■ ■ , Jm) and being associative on arrows. 

In the spirit of the previous remark we will call unary (single-holed) contexts the arrows in 
C (with domain different from 0), and multi-holed contexts the arrows in T>. 
Two other definitions are necessary. 
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Definition 3.4. Given a reactive system C on a category C, and a category V, list extension 
of C: 

(i) we define a multi-holed context g : {Iq, . . . ,In) — > I IPO uniform if for any context 
f : I ^ J appearing as label in the IPO Its, there exists a list of multi-holed contexts 
gi : {hfi, . . . , h^m) J,...,gh ■ {hfi, ■ ■ ■ , Ih,nJ J, and a list of functions h ■ 
{0, . . . , ni} ^ {0, . . . , n}, . . . ,lh : {0, . . . , n^} {0, . . . , n} such that, for any n-tuple 
of C terms Iq : ^ Iq, . . . ,tn ■ ^ In, '^^ have that: 

- Vi g{to ^ ...(g)tn) — ^/ gi{tk{Q) . . . (8) ti^(n,)) and 

- g{to «) ... (8) in) — >i t' =^ 3i. t' = gi{t^Q) ^ ■■■^ k,{n,)); 

(ii) a context g : {Iq, . . . , In) ^ I has a reactive index i if for any list of n terms Iq : ^ 
/o, . . . , tj+i, . . . ,tn ■ ^ In, the context g{to ... 81 U-i 8) idj. (g) U+i ... 81 tn) : 
Ii — > /, seen as a context in C, is reactive. 

Intuitively, a context g is IPO uniform if the behavior wrt the IPO reaction of the term 
di'^hio) ^ ■ ■ ■ tii(ni)) does not depend on the terms ^^.(o), ■ ■ ■ ,tii{ni)- We remark that the 
notion of "uniform" is not a generalization of the notion of "definable by contexts" . 

Proposition 3.5. Let C be a reactive system having redex RPOs. 

(i) The weak IPO bisimilarity (with the identity IPO context unobservable) is a con- 
gruence if there exists a category V, list extension of C such that any (multi-holed) 
context g : (Iq, . . . , In) I is either IPO uniform or it has a reactive index (or both). 

(ii) Moreover, if the reaction relation is deterministic, i.e., any term can react in at most 
one possible way, then the relation ~/ coincides with 

Proof. Here we present only the proof of point (ii). The proof of point (i) is almost identical 
and can be derived, from the present proof, by substituting the relation with ~/, and 
by simplifying some steps. 

By repeating the same arguments used at the beginning of the proof of Proposition [321 
it is sufficient to prove that the relation 

S = { {g{to (g) . . . tn),g{uQ ® ...®Un)) \ g : {Io,...,In) I, ^i-ti Ui } 

is contained in the weak bisimilarity. By Lemma 12.121 it is sufficient to show that for any 
{gj,to . . . (g) tn),g{uo . . . (g) Un)) € S and IPO-transition /, if g{tQ (g . . . g) t„) t, with 
— >j the last step of the chain of reactions, then there exists u s.t. g{uo (g . . . (g) Un) u 
with tS*u. The |)roof is by double induction on the number of steps of the transition 
g{tQ (g) . . . (gi tn) t, and on the number n of holes in the list context g. 

The basic case is when g(tQ (g) . . . (g) t„) t in steps (/ = id), in this case there is 
nothing to prove. ^ 

Now suppose g{tQ (g . . . (g t„) t' t, in this case (/' = id A f" = /) or 

(/' = /A/" = zdAt' = t), 

There are two cases to consider: 
(i) The context g is IPO-uniform: in this case there exists a context e : {Iq, . . . , I'^,) Ji 
and a function / :^ {0, . . . n'} — > {0, . . . , n} such that t' = e{ti(fj^ (g . . . (g) *«(„')) and 
g{uQ (g . . . (g Un) -^i e(u;(o) (g • . . (g ui(^n'))- By application of the inductive hypothesis, 
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on a smaller number of transitions steps, there exists u s.t. e(ti;(o) ® ■ ■ . (X) u 
with tS*u, and from which the claim follows, 
(ii) The context g has a reactive index i, for the sake of simplicity, assume i = 0. Consider 
the arrow g' = g{tQ (g) idj^ ... (8) idj^) : (Ii, . . . , I„) I. Since . . . t„) = 
5r(to (g . . . (8) tn) t, by inductive hypothesis, on the number of holes in the multi- 
holed contexts, there exists u such that g'{ui ... Un) = g{to (8 ui CS> • • • (g Un) ==^i u, 
with tS*u. 

Now consider the context g" = g{Idif^ ® ui® . . . ® Un) : lo ^ I- The context g" is 
reactive and g"{to) ==>i u. To obtain the claim, it remains to prove that there exists 
u' s.t. g"iuo) = g{uQ . . . (g) n„) =>i u', with uS*u' . 

More generally we prove that for any reactive context go '■ Jq ^ Ji, any IPO context 
/ : Jl ^ J2, and any^air of terms to,Uo, if to ~_r, Uo and go{to) ==>i t'^ then there 
exists s.t. goiuo) uj, and t'^S*u'^. The proof is by induction on the number of 
steps in the transition g"{to) ==^i t^. The basic case is when the reaction is of zero 
steps; in this case there is nothing to prove. 

For the inductive case consider the following diagram of IPO squares defining the 
first reaction in the chain 

to -r go 




We need to consider two cases. The first one is where /' is a reactive context (/' € 
{/, Id}). Since reactive contexts are composition-reflecting, then also the IPO context 
/" is reactive. By the definition of bisimilarity, Uo Ui with Ui ~^ dr. By reactivity 
of go, using suitable IPO pasting diagrams, we can prove ^o(^o) d'ui. Now by 

applying the inductive hypothesis to the reduction d'{dr) t'^, we obtain the claim. 

The second case is where /' is a non reactive context (/' = /). Since reactive 
contexts are compositional reflecting, then also the IPO context /" is non reactive 
and therefore, by hypothesis, IPO uniform. Notice that the context Id is an IPO 
context for the term f"{to), by the IPO uniformity of /", Id is an IPO context also 
for f"{uo) and there exist a list context g'{Ji, . . . Ji) —* J s.t. to g'{to, ■ ■ ■ ,to) and 
also Uo — >i g'{uo, . . . ,Uo). Notice that, if the reduction relation is deterministic, two 
terms that reduce one to the other via r transitions are weakly bisimilar. It follows that 
go{to) — ^/ d'g'{to, . . . ,to) t'o and, by IPO pasting, go{uo) -^i d'g'{uo, . . . ,Uo), 
from which we derive the claim. □ 

Remark 3.6. Propositions 13.21 and 13.51 above, about congruence of the weak IPO bisimilar- 
ity, are more related than what they look at first glance. From one side, by exploiting the 
fact that the composition of a non-reactive context with any context gives a non-reactive 
context, one can show that, if the non-reactive IPOs are definable by contexts, then any 
non-reactive context is IPO-uniform. Note that the condition of "definability by context" 
is in general simpler to verify than the one of "IPO-uniformity" , and so we prefer to present 
the given formulation of Proposition 13. 2[ On the other side, it would be possible to extend 
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the notion of "definability by context" to the case of list extension categories, however to 
this aim it would be necessary to present a series of new definitions, necessary to lift the 
IPO construction to the list extension categories. For the sake of simplicity, we prefer to 
avoid the introduction of these further notions. 

4. The Lambda Calculus 

First, we recall the A-calculus syntax together with lazy and cbv reduction strategies and 
observational equivalences. Then, we show how to apply the RPO technique to A-calculus, 
viewed as a context category, and we discuss some problematic issues. 

4.1. SyntEix, Reduction Strategies, Observational Equivalences. 

Definition 4.1 (Syntax). The set of X-terms A is defined by 

(A 3) M ::= x\ MM \ Xx.M , 

where x € Var is an infinite set of variables. Let FV(M) denote the set of free variables in 
M, and let us denote by A° the set of closed A-terms. 

As usual, A-terms are taken up-to a-conversion, and application associates to the left. 
We consider the standard notions of /?-rule and /?y-rule: 

Definition 4.2. 

(i) /?-rule: {Xx.M)N M[N/x]; 

(ii) /?i/-rule: {Xx.M)N -^py M[N/x\, if iV is a variable or a A-abstraction. 

As usual, we denote by =fj and =py the corresponding conversions. 

A reduction strategy on the A-calculus determines, for each term which is not a value, 
a suitable /3-redex appearing in it to be contracted. The lazy and cbv reduction strategies 

are defined on closed A-terms as follows: 

Definition 4.3 (Reduction Strategies). 

(i) The lazy strategy — >iC A^ x A*^ reduces the leftmost /3-redex, not appearing within a 
A-abstraction. Formally, -^i is defined by the rules: 

N^lN' 

{Xx.M)N M[N/x] NP N'P 

(ii) The call by value strategy ^t,C A° x A° reduces the leftmost /3y-redex, not appearing 
within a A-abstraction. Formally, — is defined by the following rules: 

N -^^ N' N -^^ N' 

{Xx.M)V M[V/x] NP -^y N'P {Xx.M)N {Xx.M)N' 
where F is a closed value, i.e., a A-abstraction. 
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We denote by — >* the reflexive and transitive closure of a strategy ^o-, for a G {^w}, by 
Vala^ the set of values, i.e., the set of terms on which the reduction strategy halts (which 
coincides with the set of A-abstractions in both cases), and by M the fact that there 
exists V € Val^ such that M ^%V. 

As we will see in Section 14.21 below, each strategy defines a (deterministic) reactive 
system on A-terms in the sense of Definition 12.11 To this aim, it is useful to notice that 
the above reduction strategies can be alternatively determined by specifying suitable sets of 
reactive contexts (see Remark 14. 51 below), which are subsets of the following unary contexts, 
i.e., contexts with a single hole: 

Definition 4.4 (Unary Contexts). Let P € A. The unary contexts are: 

C[] ::= []|PC[]|C[]P|Ax.C[]. 
The closed unary contexts are the unary contexts with no free variables. 
Remark 4.5. 

(i) The lazy strategy — is the closure of the /3-rule under the reactive contexts, corre- 
sponding to the closed applicative contexts: D[] ::= where P ^ . 

(ii) The cbv strategy — is the closure of the /3y-rule under the following closed reactive 
contexts: D[\ ::= [ ] [ D[ ]P \ {Xx.M)D[ ] , where P, Xx.M G A°. 

Each strategy induces an observational (contextual) equivalence a la Morris on closed terms, 
when we consider programs as black boxes and only observe their "halting properties" . 

Definition 4.6 (cj-observational Equivalence). Let -^a- be a reduction strategy and let 
M,N E A'^. The observational equivalence ~o- is defined by 

M ^„ N iff for any closed unary context C[]. C[M] ]^^^ C[N] . 

The definition of ~(j can be extended to open terms by considering closing (by-value) 
substitutions, i.e., for M,N £ A s.t. FV{M,N) C {xi, . . . ,x„}, we define: 

M^^N iff for all closing (by-value) substitutions P, M[P/x] ^„ N[P/x\ . 

Remark 4.7. Often in the literature, the observational equivalence is defined by considering 
multi-holed contexts. However, it is easy to see that the two notions of observational 
equivalences, obtained by considering just unary or all multi-holed contexts, coincide. 

The problem of reducing the set of contexts in which we need to check the behavior 
of two terms has been widely studied in the literature. In particular, for both strategies 
in Definition 14.31 above, a Context Lemma holds, which allows us to restrict ourselves to 
applicative contexts of the shape [ ]P ([ ]V), where P (V) denotes a list of closed terms 
(values). Let us denote by k,'^^ the observational equivalence which checks the behavior of 
terms only in applicative (by-value) contexts. This admits a coinductive characterization 
as follows: 

Definition 4.8 (Applicative u-bisimilarity) . 
(i) A relation 7e C A° x A° is 
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— an applicative lazy bisimulation if the following holds: 

{M,N) G 7^ ^ (M N i^i) A VP G A°. {MP,NP) G 7^. 

— an applicative cbv bisimulation if the following holds: 

(M, N) en =^ (M ^ Nij^y) A Vy closed value. {MV, NV) G 71. 

(ii) The applicative equivalence i^^^^ is the largest applicative bisimulation. 

The following is a well-known result |A()93[ [EHR92] : 

Lemma 4.9 (Context Lemma). ^^^=^0^^. 

By the Context Lemma, the class of contexts in which we have to check the behavior 
of terms is smaller, however it is still infinite, thus the applicative bisimilarity is infinitely 
branching. In the following, we will study alternative coinductive characterizations of the 
observational equivalences, arising from the application of Leifer-Milner technique. 

4.2. Lambda Calculus as a Reactive System. Both lazy and cbv A-calculus can be 
endowed with a structure of reactive system in the sense of Definition 12. 1^ by considering 
corresponding context categories. 

Definition 4.10 (Lazy, cbv A-reactive Systems). C^, for a G {l,v}, consists of 

• the category whose objects are 0, 1, where the morphisms from to 1 are the closed terms 
(up-to a-equivalence) , the morphisms from 1 to 1 are the unary closed contexts (up-to 
a-equivalence) , and composition is context insertion; 

• the subcategory of reactive contexts is determined by the reactive contexts for the lazy 
and cbv strategy, respectively, presented in Remark 14. 5j 

• the (infinitely many) reaction rules are {\x.M)N — M[N/x\, for all M,N, where — >/3; 
is ^/3-rule, while — >/3„ is — >/3^-rule. 

The above definition is well-posed, in particular the subcategory of reactive contexts is 
composition-reflecting. 

One can easily check that the reactive system has redex RPOs; this fact can be 
proved by rephrasing the corresponding proof for the category of term contexts of |Sew02] . 
Here it is essential the fact that we consider only closed terms and closed contexts. 

Lemma 4.11. The reactive system C^, for a G {l,v}, has redex RPOs. 

The IPO contexts of a closed term for the lazy and cbv reactive systems are summarized 
in the second columns of the tables in Fig. [3l Intuitively, such contexts are minimal for the 
given reduction to fire. Vice versa, contexts different from the ones above are not IPO; e.g. 
C[ ]P, for terms of the shape Ax.M, is not IPO if C[ ] is different from \x.Ci[ ] and [ ], 
because otherwise the reduction can fire already in C[ ]. 

The strong versions of context and IPO bisimilarities are too fine, since they take 
into account reaction steps, and tell apart /3-convertible terms. Trivially, I and /I, where 
/ = Xx.x, are equivalent neither in the context bisimilarity nor in the IPO bisimilarity, since 
[] [] 

/ tA, while // — > (both in the lazy and cbv case). On the other hand, one can easily check 
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Lazy IPO Its's 



term 


IPO contexts 


reactive IPO contexts 


Xx.M 


[]P, (Ax.C[])P, PC[] 


[]P 


{Xx.M)NP 


[], (Ax.C[])P, PC[] 


[] 



Cbv IPO Its's 



term M 


IPO contexts 


reactive IPO contexts 


Ax. Ml 


]P, {Xx.C[ ])P, RC[ ], {Xx.Q)Ci[ ] 


[]P, iXx.Q)[] 


{Xx.Mi)NP 


[], {Xx.C[])P, RC[], {Xx.Q)Ci[] 


[] 



where R is not a value and Ci[Af] is a value. 



Figure 3: IPO contexts for the lazy/cbv Its's. 

that the weak context bisimilarity, where the identity context [ ] is unobservable, equates 
all closed terms. The appropriate notion is that of weak IPO bisimilarity, which, as we will 
see, turns out to capture exactly the lazy and cbv equivalences. 

It is interesting to observe that also the observational equivalence and the applicative 
bisimilarity can be characterized as weak bisimilarities on suitable context Its's. In fact it is 

easy to prove that the observational equivalence ~o- coincides with the weak bisimilarity on 

C'[] C[] 

a restriction of the context Its built on C^, defined by M — > N iff M — >c N and M JJ-g-- 
Similarly, the applicative equivalence can be characterized by considering only applicative 
contexts in the Its. 

In the following we will show that all these Its's induce the same notion of equivalence. 
Moreover, using the results of Section [3l we will show that the set of IPO contexts in the 
weak IPO bisimilarity to be considered can be significantly simplified. Then, from the fact 
that the weak IPO Its is the smallest of the ones above, it follows that it induces the simplest 
proofs that two terms are bisimilar. 

Now, let us denote by ~cr/, for a € {l,v}, the lazy/cbv weak IPO bisimilarity, where 
the identity context is unobservable. In order to prove that is a congruence w.r.t. all 
contexts, we need to consider the category V^, list extension of C^, where the objects are 
finite lists (1, . . . , 1), and an arrow 

71 m 

is a m-tuple of possibly closed multi-holed contexts (Ci, . . . , Cm) with n holes all together. 
Multi-holed contexts are defined by 

C[] ::= []\P\C[]C[]\Xx.C[]. 

Then, in the lazy case one can show that any closed multi-holed context either is IPO 
uniform or it is of the shape [ ]Ci[ ] . . . Cfc[ ] with the first hole reactive. Namely, if C[ ] is of 
the shape [ ]Ci [ ] . . . Cfc[ ], then clearly the first hole is reactive. Otherwise, it is of the shape 
PCi[ ] . . .Ck[] or {Xx.CqI ])Ci[ ] . . . Cfc[ ]. In the first case, the reduction (if any) involves 
only P or at most PCi[ ], where Ci[ ] together with the term put in the holes, plays only a 
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passive role as argument. In the latter case, since the term put in the holes is closed, again 
it will be not affected by the substitution induced by the reduction. Similarly, for the cbv 
case, all the multi-holed contexts are IPO uniform, apart from the contexts ranging on the 
following grammar, which have a reactive hole: 

D[] ::= []\D[]C[]\{\x.C[])D[], 

where C is a closed multi- holed context. Moreover, the reduction relation is obviously 
deterministic. Thus, by applying Proposition 13.51 we have: 

Corollary 4.12. 

(i) For all M,N E A*^, for any closed unary context C[], 

M N =^ C[M] ^^j C[N] . 

(ii) Moreover 

where ~o-_r denotes the weak IPO bisimilarity where only reactive contexts are consid- 
ered (see the third columns in the tables of Fig. l^. 

Now, we are left to prove that the IPO bisimilarity coincides with the original observational 
equivalence. Notice that, in the above proposition, we also provide a new alternative proof 
of the Context Lemma for the lazy case. 

Proposition 4.13. ~i = = and ~^ = ^vl- 

Proof. For the lazy case, we proceed by proving the following chain of inclusions: 

C ^^PP C C C . (4.1) 

The first inclusion, C ss"^^, holds by definition. The third inclusion, C ss^/, 

follows by Corollary 14. 12( ii). The others are proved as follows: 

• ^1^^^ C ^ifi. We prove that Ri"^^ is a "weak IPO reactive bisimulation" . Let M k,'^^^ N. 

Assume M — >/ M' in the IPO reactive system. By case analysis on M and C[ ] we show 

that 3iV'. N' A M' ^f^^ N' . 

If M EE (Ax.Mi)QP and C[ ] ee [ ], then ^ A^, M' =^ M '^^^^ N, hence closedness 
of Ri"^^ under /3-conversion establishes the claim. 

If M EE Xx.Mi and C[] = [ ]P, then, since M '^1^^ N, 3Xx.Ni. N U Xx.Ni A^'. 
Then M' =^ MP A^P =p N' , and closedness of ~i under /3-conversion establishes 
the claim. 

• ^ ^i- Let M A^. We have to show that, for any unary closed context C[], 
C[M] ^ ^ C[N] ^. From M N, by Corollary EZJ^i) , we have C[M] C[N]. 

Now assume that C[M] Jl^, then there exists M' such that C[M] M', hence also there 

exists N' such that C[N] A^', thus C[N] 
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The above argument provides a new proof of the Context Lemma. 

For the cbv case, considering the apphcative equivalence ^v^^ does not help, but one 
can prove directly: 

^ '^vR ^ ~vl ^ (4.2) 

• C One can easily check that ~„ is a "weak IPO reactive bisimulation" , using 
the fact that k,^ is closed under /3-reduction. 

• ~vR ^ ~i>/- Immediate by Corollary [4.12r ii). 

• ^vl ^ Let M N. We have to show that, for any unary context C[ ], 
C[M] C[N] From M N, by Corollary Ol^i) , we have C[M] C[N]. 

Now assume that C[M] J|^, then there exists M' such that C[-/Vf] M', hence also there 

exists N' such that C[N] N' , thus C[N] □ 

Remark 4.14. Corollary 14. 12l fii) allows us to reduce the set of IPO contexts to be consid- 
ered in the IPO bisimilarities. For the lazy case, only applicative contexts can be considered 
(see the first table in Figure [3j), while for the cbv case, the set of reactive IPO contexts is 
larger (see the second table in Figure [3]). However, also for the cbv case, one can prove that 
applicative (by-value) IPO contexts are sufficient. We omit the details. 

Proposition 14.131 above gives us interesting characterizations of lazy and cbv observa- 
tional equivalences, in terms of Its's where the labels are significantly reduced. However, 
such Its's (and bisimilarities) are still infinitely branching, e.g. Xx.M — >/, for all P G A*^. 
This is due to the fact that the context categories underlying the reactive systems C^^ and 

allow only for a ground representation of the /3-rule through infinitely many ground 
rules. In order to overcome this problem, one should look for alternative categories which 
allow for a parametric representation of the /3-rule as {Xx.X)Y X[Y/x], where X,Y 
are parameters. To this aim, we introduce the category of second-order term contexts (see 
Section E] below). However, as we will see, this approach works only if the reaction rules 
are "local", that is, they do not act on the whole term, but only locally. In particular, the 
operation of substitution on the A-calculus is not local and thus it is not describable by 
a finite set of reaction rules. To avoid this problem, in the following section we consider 
encodings of the A-calculus into Combinatory Logic (CL) endowed with suitable strategies 
and equivalences, which turn out to correspond to lazy and cbv equivalences. 



5. Combinatory Logic 

In this section, we focus on Combinatory Logic |HS86j with Curry's combinators K, S, 
and we study its relationships with the A-calculus endowed with lazy and cbv reduction 
strategies. An interesting result that we prove is that we can define suitable reduction 
strategies on CL-terms, inducing observational equivalences which correspond to lazy and 
cbv equivalences on A-calculus. As a consequence, we can safely shift our attention from the 
reactive system of A-calculus to the simpler reactive system of CL. In this section, we apply 
Leifer-Milner construction to CL viewed as a (standard) context category, and we study 
weak versions of context and IPO bisimilarities. Our main result is that we can recover 
lazy and cbv observational equivalences as weak IPO equivalences on CL*, a variant of 
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standard CL. Here the approach is first-order, thus the IPO equivalences are stiU infinitely 
branching. However, the results in this section are both interesting in themselves, and 
useful for our subsequent investigation of Section [U where CL is viewed as a second-order 
rewriting system, and a characterization of the lazy observational equivalence as a finitely 
branching IPO bisimilarity is given. 

In |Sew02| . a construction, similar to Leifer-Milner construction, has been applied to 
the Combinary Logic case. However, in that paper, it has been left open the question of 
whether the weak bisimilarity on the derived LTS is a congruence. In this paper, using 
Proposition 13.51 we can positively answer that question. 

Definition 5.1 (Combinatory Terms). The set of combinatory terms is defined by: 

{CL 3) M ::= x | K | S | MM , 
where K, S are combinators. Let CL^ denote the set of closed CL-terms. 



5.1. Correspondence with the A-calculus. Let A(K,S) denote the set of A-terms built 
over constants K,S. The following is a well-known encoding: 

Definition 5.2 (A-encoding). Let T : A(K, S) — > CL be the transformation defined as 
follows: 

T{x) = X T{C) = C if C G {K, S} 

T{MN) = T{M)T{N) T{Xx.MN) = ST{Xx.M)T{Xx.N) 

T{Xx.x) = SKK T{Xx.Xy.M) = T{Xx.T{Xy.M)) 

T{Xx.y) = Ky T{Xx.C) = KT{C) if C G {K, S} 

In particular, if we restrict the domain of T to A, we get an encoding of A-terms into CL. 

Vice versa, there is a natural embedding of CL into the A-calculus £ : CL — > A: 

^(K) = Xxy.x £{S) = Xxyz.{xz){yz) £{x) = x £{MN) = £{M)£{N) 
The following lemma holds: 
Lemma 5.3. For all M e A, £{T{M)) M, for a E {/3,/3y}. 

Proof. First, one can easily prove that, if M is A-free, then £T{Xx.M) Xx.M (by 
induction on M). Then, using the fact that T{M) is A-free for all M, by definition of T, 
one gets that T^(M) = T{M) for all M. Finally, we are ready to prove the claim in its full 
generality by induction on M. The only non-trivial case is when M = Xx.Xy.N. Then we 
have £T{Xx.Xy.N) = £T{Xx.T{Xy.N)), where T{Xy.N) = PQ is A-free. Then 

£T{Xx.Xy.N) = £T{Xx.PQ) = £{S)£T{Xx.P)£T{Xx.Q) 

= {Xxyz.{xz){yz))£T{Xx.P)£T{Xx.Q) 

=o- {Xxyz.{xz)(yz))Xx.PXx.Q , since PQ is A-free, 

=CT Xx.PQ =cr Xx.£T{PQ) , since PQ is A-free, 

= Xx.£rT{Xy.N) = Xx.£T{Xy.N) , since = T, 

=o- Xx.Xy.N , by induction hypothesis. □ 
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5.1.1. Lazy/cbv observational equivalence on CL. Usually, the set of combinatory terms are 
endowed with the following reaction rules: 

KM AT ^ M SMNP {MP){NP) 

We will also consider a cbv version of the above rules, reducing CL redexes only when the 
arguments are values, i.e., terms on the following grammar: 

F ::= K I S I Ky |SF | SVV . 

The cbv rules are the following: 

KV1V2 ^ Vi SV1V2V3 ^ (ViVs) {V2V3) 

Definition 5.4 (Lazy/cbv Reduction Strategy on CL). 

(i) The lazy reduction strategy — C CL^ x CL^ reduces the leftmost outermost CL- 
redex. Formally: 

M -^i M' 

SM1M2M3 (MiM3)(M2M3) KM1M2 Ml MP M'P 

(ii) The cbv strategy ^„ C CL° x CL° is defined by 

Ml M( 

SFiFaV^s -^v {ViV^){V2V^) KFiFs -^v KMi KM( 

M2 M^ Ml M[ M2 M^ 

KV1M2 -^v KViM^ SMi SM{ SV1M2 SViM!^ 

Ms ^„ M^ M M' 

SF1F2M3 SViV2M^ MP -^^ M'P 

where Vi, V2, V3 are values. 

Definition 5.5 (Unary Contexts on CL). The set of unary contexts on CL is defined by 

C[] ::= []\C[]P\PC[]. 

Alternatively we could define the lazy strategy -^i as the closure of the standard CL- 
reaction rules under the following reactive contexts (which coincide with the applicative 
ones) : 

D[] ::= []\D[]P. 

Similarly, we could define the cbv strategy ^„ as the closure of the cbv reaction rules under 
the following reactive contexts: 

D[] ::= [ ] I D[ ]P \ KD[ ] \ KVD[ ] \ SD[] \ SVD[ ] | SViV2D[ ]. 

Let lo- denote the convergence relation on CL, for a G {l,v}. 

Definition 5.6 (Lazy/cbv Equivalence on CL). 
(i) A relation 7^ C CL° x CL^ is a 
— CL lazy bisimulation if: 

(M, N) €n =^ {M ii^ N ii) A VP € CL°. (MP, NP) € U . 
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— CL cbv bisimulation if: 

(M, N) en ^ {M U ^ N U) A \/ closed value V G CL°. {MV, NV) G 7^ . 

(ii) Let ~o- C CL*^ X CL*^ be the largest CL lazy/cbv bisimulation. 

(iii) Let ~o- Q CL x CL denote the extension of ~(j to open terms defined by: for M, N G 
CL s.t. FV{M, N) C {xi, . . . , Xn}, MSi^N iff for all closing (by- value) substitutions 
P, M[P/x] N[P/x]. 

Notice that we use two different symbols for equivalences (~ and ~), in this way we distin- 
guish the equivalence relation on A-terms from the corresponding relation on CL. 
The following theorem is interesting per se: 

Theorem 5.7. For aUM,N e A, MS^iV T{M)^^T{N) . 

Proof of Theorem 15.71 We carry out the proof of the above theorem for the lazy case, 
the proof for the cbv case being similar. 

Lemma 5.8. 

(i) For all M G CL^, M U ^ 8{M) 

(ii) For all M € , M % T{M) U . 

Proof. 

(i) By definition of the lazy strategies on A-terms and on CL-terms. 

(ii) (=^>) Let M Jj.;. Then, since by Lemma ISTSl 8(T(M)) =j3 M, and is closed under 
/3-conversion, we have also f (T(M)) J|;. Thus, by (i), T{M) li. 

(^) Let r(M) i;. By (i), £{T{M)) by Lemma[01 M =p £{T{M)), thus M ij.i.\J 
Lemma 5.9. For all M,N £ CL^, if £{M) =p £{N), then M ~, A^. 

Proof The proof follows from the fact that 7^ = {{M,N) G CL^ j £{M) =p £{N)} is a 
CL lazy bisimulation. Namely M [i iff N J,;, because, by Lemma ISTSl i) . M [i iff £{M) li 
and A'' [i iff £{N) [i, and Ri"^^ is closed under /3-conversion. Moreover, for any P G CL^, 
{MP, NP) G n, since £{MP) = £{M)£{P) =p £{N)£{P) = £{NP). □ 

Lemma 5.10. VP G CL°, P -/ T{£{P)). 

Proof We prove that 7^ = {{PR,T{£{P))R) \ P,R e CL°} is a bisimulation. To this 
aim, it is sufficient to prove that, for all P,R, PR li <^ T{£{P))R JJ-/. By Lemma |5.8[ 
PR ii ^ £{PR) ^i. Now £{PK) = £{P)£{R) =^{£oTo £{P))£{R) = £{{T o £{P))R). 
Finally, by Lemma ESI £{{T o £iP))R h ^ T{£{P))R j/. □ 

Lemma 5.11. Let M G A and let P be closed such that M[P /x] G A'^, then 
r{M[P/x])^i T{M)[T{P)/x] . 

Proof. By Lemma EH it is sufficient to show that £{T{M[P/x\)) =p £{T{M)[T{P)/x]). 
Now £{T{M[P / x])) =p M[P/x\, by Lemma |5.3[ On the other hand, from the definition 
of £, one can prove by induction that £{T{M)[T{P)/x\) = £T{M)[£T{P)/x], which, by 
Lemma E2\ =p M[P/x]. □ 
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Now we proceed to prove Theorem 15.71 (=^). Assuming M^iN, we have to prove 
that, for all closing P, T{M)[P/x] ~i T{N)[P/x]. From M^iN it follows M[S{P)/x] 
N[£{P)/x]. By Lemmata 15.81 15.101 using the fact that ~; is a congruence, we have 
T{M[£{P)/x]) ~z T{N[£{P)/x]). By Lemma EUl T{M)[T£{P)/x\ T{N)[T£{P)/x\, 
hence by Lemma 15. IH using the fact that ~z is a congruence, we have T(M)[P/x] ~z 
T{N)[P/x\. 

In order to prove Theorem 15.71 (<^=), assume T{M)c^iT{N). We have to prove that, 
for all closing P, M[P/x] N[P/x]. From T{M)^iT{N) it follows T{M)[T{P)/x] ~z 
T{N)[T{P)/x]. From Lemma EZIIl we have T{M[P/x]) ~z T(iV[P/x]). By Lemma ESI 
we have M[P/x] N[P/x]. 

5.2. The First-order Approach: CL as a Context Category. We endow CL with a 
structure of reactive system in the sense of [LMOOj , by considering the context category of 
closed unary contexts: 

Definition 5.12 (Lazy, cbv CL Reactive Systems). C^, for a € {l,v}, consists of: 

• the context category whose objects are 0, 1, where the morphisms from to 1 are the 
closed terms, the morphisms from 1 to 1 are the closed unary contexts, and composition 
is context substitution; 

• the subcategory of reactive contexts is determined by the reactive contexts for the lazy 
and cbv strategy, respectively, presented in Definition 15. 4t 

• the reaction rules are the standard CL reduction rules for the lazy case, and the cbv 
reduction rules for the cbv case. 

Lemma 5.13. The reactive systems C\ have redex RPOs. 

One can easily check that the IPO contexts are the following. 

• Lazy. The IPO contexts for a given term M are: 

— [ ]-P, where P has the minimal length for the top-level reaction of M to fire, 

— KC[]Pi, KPiC[], KPiQC[],foranyC7[],Q,A, 

— SC7[]PiP2, SPiC[]P2, SPiP2C[], SPiP2QC[],foranyPi,P2,C[],Q. 

• Cbv. 

For M not a value, the following contexts are IPOs: 

— [], 

For M value, the following contexts are IPOs: 

— [ ]Vi . . . Vi, where i is the minimum number of arguments necessary for the top-level 
reaction of M to fire, 

— [ ]Vi . . . ViP, where P is not a value, and i, possibly 0, is less than the minimum number 
of arguments necessary for the top-level reaction of M to fire, 

— VC[ ]Vi . . .Vi where V and C[M] are values and z + 1 is the minimum number of 
arguments necessary for the top-level reaction of V to fire, in more detail: KC[ ]V, 
KVC[ ], SC[ ]ViV2, SViC[ ]V2, SViV2C[ ], 

— VC[ ]Vi . . . ViP, where V and C[M] are values, P is not a value, and z + 1 is less than 
the minimum number of arguments necessary for the top-level reaction of V to fire, in 
more detail: KC[ ]P, SC[ ]P, SC[ ]ViP, SViC[ ]P. 
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For any term M, the following contexts are IPOs: 

— PC[ ], where P is not a value and C[ ] is any context. 
For any of the above contexts there is a reduction rule which applies, and the context 
is minimal for the given reduction to fire. By case analysis, one can show that all the other 
contexts are not IPO contexts. 

The strong versions of context and IPO bisimilarities are too fine, since, as in the A- 
calculus case, they take into account reduction steps, and tell apart /3-convertible terms. 
Thus we consider weak variants of such equivalences, where the identity context [ ] is un- 
observable. Weak context bisimilarity is too coarse, since it equates all terms. However, we 
will prove that the weak IPO bisimilarity "almost" coincides with the lazy/cbv equivalence. 
Moreover, we will show how to recover the exact correspondence by considering a suitable 
variant of CL. 

First of all, let for a G {^u}, denote the lazy/cbv weak IPO bisimilarity obtained 
by considering the identity context as unobservable. Similarly to the case of the A-calculus, 
we can define a list extension category by taking the category of multi- holed contexts. In 
this category all contexts with no reactive indexes are IPO uniform. In the lazy case, the 
contexts with a reactive index are of the shape [ ]Ci[ ] . . . Cfc[ ] (with the leftmost hole being 
reactive), and the remaining ones have not reactive indexes and are IPO uniform. For the 
cbv case, one can show that the multi-holed contexts with a reactive index are given by the 
grammar: 

D[] ■■= []\ D[ ]C[ ]\KD[]\ KVD[ ]\SD[]\ SVD[ ] \ SViV2D[ ] , 

where C[ ] is any closed multi-holed context. 
Thus, by Proposition 13. 5f i). we have: 

Proposition 5.14. For all M,N € CL^, for any closed unary context C[ ], 
M N =^ C[M] C[N] . 

The rest of this section is devoted to compare the lazy/cbv weak IPO bisimilarity 
with the lazy/cbv equivalence on CL ~o- defined in Definition 15.61 The following lemma 
can be easily proved by coinduction, using Proposition 15.141 

Lemma 5.15. ~ct/ C 

Proof. We prove that is a lazy/cbv bisimulation on CL. Let M N. If M then 
also J,(j, since a convergent term has different IPO-transitions from a divergent term. We 
are left to prove that for all P, MP NP. But this follows from Proposition 15.141 □ 

However, the converse inclusion ~(j C does not hold, since for instance K 

S(KK)(SKK), because, e.g. for the lazy case, for all P, S(KK)(SKK)P ^* KP. But 

\]v 

K S(KK)(SKK). Namely S(KK)(SKK) while Ky^j. The problem, which was 

already noticed in [Sew02j , arises since the equivalence tells apart terms whose top-level 
combinators expect a different number of arguments to reduce. In order to overcome this 
problem, we consider an extended calculus, CL*, where the combinators K and S become 
unary, at the price of adding new intermediate combinators and intermediate reductions 
(the reactive contexts are the ones in Definition I5.12p . 
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Definition 5.16. The CL* lazy combinatory calculus is defined by 

• Terms: 

M ::= X I K [ S I K'M \ S'M \ S"MN \ MN 

where K, K', S, S', S" are combinators. 

• Rules: 

KM ^ K'M K'MA^ ^ M 

SM ^ S'M S'MN S"MN S"MNP {MP){NP) 
The CL* cbv combinatory calculus is defined by 

• Terms: 

M ::= X I K I S I MN \ K'V \ S'V \ S"VV 
Values: 

V ::= K I K'V \ S \ S'V \ S"VV 

where K, K', S, S', S" are combinators. 

• Rules: 

KVi K'Vi K'ViV2 Vi 

SVi ^ S'Vi S'ViV2 ^ S"ViV2 S"ViV2V3 ^ (yil^3)C^^2^3) 

Notice that the calculus in the above definition is well-defined, since the set of terms is 
closed under the reaction rules. One can define lazy/cbv reduction strategies on CL* as in 
Definition 15.41 or as the closures of the reaction rules under the following reactive contexts: 

Definition 5.17 (CL* Reactive Contexts). 

• Lazy. D[] ::= [ ] | D[ ]P . 

• Cbv. D[] ::= [] [ D[]P \ VD[]. 

Let ~* be the lazy/cbv equivalence defined on CL*, similarly as in Definition 15.61 for CL. 
There is a trivial embedding of CL-terms into CL*. Moreover, one can easily check that, 
when restricted to terms of CL, ~* coincides with 

Analogously to the CL case, we define the reactive system over CL*. In the context 
category, the unary closed contexts are defined by the grammar 

C[] ::= []\C[]M\MC[] 

where M is a closed term. Notice that, under the above definition, expressions like K'[ ] 
do not represent unary closed context. In defining the IPO transitions, it is important to 
observe that C[M] is a value iff M is a value and C[ ] is the identity context [ ]. Let us 
denote by ~*j- the weak IPO bisimilarity obtained by considering the lazy/cbv reactive 
system over CL*. Since CL*-terms expect at most one argument, the IPO contexts for CL* 
are simpler than the ones for CL, and they are summarized in Figure [H 

Similarly to the previous case, one can consider the multi-holed contexts category as 
a list extension category. In this category all contexts are either IPO uniform or have a 
reactive index. Moreover, the reduction relation is deterministic. Thus Proposition 13.51 
applies and we have: 
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Lazy IPO Its's on CL* 



term M 


IPO contexts 


reactive IPO contexts 


M value 


[ ]P, PC[ ] 




M not a value 







Cbv IPO Its's on CL* 



term M 


IPO contexts 


reactive IPO contexts 


M value 


[]P,RC[IV[] 




M not a value 


[ I RC[ ] 





where R is not a value, y is a value, C[ ] is a generic unary context. 
Figure 4: IPO contexts for the lazy/cbv Its's on CL*. 

Proposition 5.18. 

(i) The equivalence 2^*^- is a congruence w.r.t. unary contexts. 

(ii) — *j = — ^/j, where — *^ denotes the IPO bisimilarity where only reactive IPO contexts 
are considered. 

By Proposition I5.18l fii) above, the weak IPO equivalence can be significantly simplified. 
Namely, in the lazy case, we obtain the weak IPO bisimilarity —m, where only applicative 
IPO contexts are considered (see Figure H]). In the cbv case, Proposition 15.181 allows us to 
reduce ourselves to contexts of the shape [ ], [ ]P, ^[ ] (see Figure H]). However, one can 
prove that also in this case we can consider only applicative by-value contexts. We skip the 
details of such proof. 

Moreover, we have K ~*j S(KK)(SKK). More in general, the weak IPO bisimilarity 
coincides with the lazy/cbv equivalence on CL: 

Theorem 5.19. For all M,N e CL°, M N M ~^ N. 

Proof. (C) One can show that C ~* by coinduction, as in the proof of Lemma 15.151 
by showing that is a bisimulation on CL*, also using Proposition 15.181 Then, since ~* 
coincides with ~(j on CL-terms, we obtain the claim. 

(5) By coinduction, showing that ~(j is a weak IPO bisimulation on CL*. □ 

As a consequence of Theorem 15 . 71 and Theorem 15. 191 above, we can recover the lazy/cbv 
observational equivalence on A-terms as weak IPO bisimilarity on CL*. 

Proposition 5.20. For all M, iV € A°, M N T{M) T{N). 

However, such notions of weak IPO bisimilarities still suffer of the problem of being 
infinitely branching, since the IPO contexts are [ ], []P for the lazy case, and [ ], []V for 
the cbv case, for ah P,V £ {CL*f. 

This problem will be solved in the next section, where we introduce the notion of 
second-order context category, and we endow CL* with such a structure. 
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6. Second-order Term Contexts 

The definition of term context category [LMOOj can be generalized to a definition of 
second-order term context category. The generahzation is obtained by extending the term 
syntax with function (second-order) variables, that is, variables not standing for terms but 
instead for functions on terms. The formal definition is the following 

Definition 6.1 (Category of Second-order Term Contexts). Let S be a signature for a term 
language. The category of second-order term contexts over S is defined by: objects are finite 
lists of naturals (ni, . . . , n^), an arrow (mi, . . . , rrih) — > (ni, . . . , nfc) is a k-tuple (ti, . . . , i^), 
where the term ti is defined over the signature E U , . . . , F™'' } U . . . , Xi,„ J, where 
FP'' is a function variable of arity mj, Xij is a ground variable. The category of second- 
order linear term contexts is the subcategory whose arrows are n-tuples of terms, satisfying 
the condition that the n-tuples have to contain exactly one use of each function variable 
-F/"* and ground variable Xij. The category of second-order function-linear term contexts, 
r2 (E), is the subcategory whose arrows are n-tuples of terms, satisfying the condition that 
the n-tuples have to contain exactly one use of each function variable moreover no 

function variable appears inside the argument of another function variable. 

Remark. Notice that the above definition of second-order linear term contexts is different 
from that given in the conference version of the present paper, |DHL08j . The modification 
was necessary because the original definition was incorrect (second-order linear contexts 
were not closed by composition). 

In the following we are going to use just a subcategory of the category of second-order 
function-linear term contexts, however, at this point, we prefer to present the original idea 
of second-order term contexts in its full generality. 

Example 6.2. Given the signature of natural numbers {0, S, -\-}, examples of second-order 
linear contexts representing arrows in (2, 0) (0, 2) are: 

(F0(),F2(S(X2,2)+X2,i)), (F2(0,0),F0() + (X2,i+X2,2)), (Ff (0, ()) , (^2,1 +^2,2)) 

Note that the last context is not function-linear. Examples of second-order function- linear 
contexts are: 

{F^0,Ff{X2,2,0)), {F^{0,0),F^{)+X2,2+X2,2), {FH0,0)+F^0,X2,2+X2,2) 

None of the above contexts is linear. Examples of second-order contexts that are neither 
function-linear nor linear are: 

(0,F2(Xi,2,X2,2)), (Ff (0,F0()),X2,2), (^2(0,0), (F^O +^1,2) + (F^O + ^2,2)) 

Intuitively, an arrow in (2, 0) (0, 2) represents a pair of contexts containing two holes 
F^jFg, where Ff is a hole that must be filled by a term representing a function with two 
arguments while Fg is a hole that must be filled by a term representing function with no 
arguments, i.e., a ground term. The first context in the pair (2,0) — s- (0,2) represents 
a function with no arguments, while the second context represent a function with two 
arguments ^2,1, X2, 2- 

One can check that the standard category of term contexts over E coincides with the 
subcategory whose objects are the lists containing only copies of the natural number 0; in 
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fact this subcategory uses function variables with no arguments and the ground variables 
do not appear. 

The identity arrow on the object (ni, . . . , Uk) is: 

(Ffi (^1,1, • ■■Xi^ni), ■ ■ • J-^fe* {^k,l, ■ ■■Xk,nk)) 

In order to define composition in the categories of second-order term contexts, it is 
convenient to consider the A-closure of the tuple of terms representing arrows and to define 
arrow composition through /3-reduction. 

The A-closure of a term t on the signature S U . . • U {Xi, . . . is 

AF["^ . . . Fj^^.XXi . . . Xn-t, which, for brevity, can also be written as XF.XX.t. In general, 
given a second-order context {ti, . . . ,tk) ■ (mi, . . . , nih) (ni, . . . , n^), we consider the 
A-closurc: XF.{XXi.ti, . . . , AX^.tfe)- The composition between the morphisms: 

XF.{XXi.si,. . . , AXfc.Sfc) : {h, . . . ,lh) ^ (mi, . . . ,mfe) 

XG.{XYi.ti, . . .,XYj.tj) : (mi, . . . ,mfe) (ni, . . . ,nj) 
is the /3-normal form of the A-expression 

XF.{XG.{XYi.ti, . . . ,XYj.t^)){XXi.si, . . . ,XXk.Sk) : {li,...,lh) ^ {ni,...,nj) 
To give an example, the composition between 

XF.XXi.F{Xi,0) : (2) ^ (1) and XG.XYiY2.G{S{Yi)) + Y2 : (1) ^ (2) 
is given by: 

XF.iXG.XYiY2.G{S(Yi)) + Y2)iXXi.FiXi,0)) 
-^f3XF.XYiY2.{XXi.F{Xi,0)){S{Yi)))+Y2) 
^pXF.XYiY2.F{{S{Yi),0)+Y2) : (2) ^ (2) . 
In other words, the composition is given by a j-tuplc of expressions ti in which every 
function variable Gi is substituted by the corresponding expression si, with the ground 
variables of si substituted by the corresponding parameters of Gi in ti. 

Note that the identity morphism is defined as a A-term implementing the identity 
function, while composition on morphisms is defined by the function composition in the 
A-setting. Given this correspondence, it is easy to prove that the categorical properties for 
the identity hold, while the associativity of composition essentially follows from the unicity 
of the normal form. 

Finally one need to prove that composition preserve linearity and function-linearity. 
For what concerns linearity, it is a well-known result that linear A-terms are closed by 
reduction. Prom this fact one can immediately prove that second-order linear contexts are 
closed by composition. 

Preservation of function-linearity can be proved similarly. First we generalize the notion 
of function-linearity to A-terms stating that a function-linear A-term is a typed lambda-term 
with constants, where 

• all the variables and constants have either a ground type or a first-order function type; 
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• each bound function variable (e.g. F) appears exactly once in the term, and only inside 
the arguments of constants (e.g. S{F{0) +0), or inside the arguments of A-expressions 
having a second-order function type (e.g. {XGXY.G{Y) + Y){XX.F{X + 5(0)))). That 
is, no function variable appears inside the argument of an expression that has first order 
function type and is not a constant (e.g. G{S{F{0)) + 0) and (XX.X + X){F{Q))). 

It is straightforward to prove that function-linear A-terms are closed by /3-reduction and 
that, given two function-linear second-order contexts, the term, whose /3-normal form defines 
composition, is a function-linear A-term. From this the claim follows. 

The main general result on second-order term contexts is the following: 

Proposition 6.3. For any signature S, in the category of second-order (linear) (function- 
linear) term contexts over S, any commuting square, having as initial vertex the empty list 
e, has an RPO. 

Proof. First we present the proof for the special case useful in this paper, namely we consider 
the restricted category containing as objects the lists with at most one element. Given 
two arrows with domain the empty list: ti : e ^ (ni) and t2 : e ^ {1^2), and two 
arrows si : (ni) (m), S2 ■ (^2) {m) completing ti and t2 into a commuting square 
{sioti = sioti : e {fn)), the corresponding RPO for this commuting square is inductively 
defined on the structures of si, S2- There are several cases to consider: 

(i) si = ci(si^i, . . . , si^fcj and S2 = £2(52,1, S2,fc2)) with ci,C2 function symbols in 
the signature S. Necessarily ci = 02 (and ki = ^2). We have to consider in which 
subterms of si and S2 the function variables, F"^ and F^^, appear. If F^^^ and 
appear in corresponding subterms, that is, there is an i such that all F"^ appears 
in si^i and all F^^ in S2,i, then we have that si^i and S2,i, together with ^1,^2, form 
a commuting square, and the RPO, inductively defined, for this second commuting 
square, immediately induces the RPO for si and S2- The subcase where F"^ and F^^ 
do not appear in corresponding subterms is treated at point (iii). 

(ii) si = F"i(si,i, . . . , si,„J and S2 = F^2(s2,i, . . . , 52,712), and, for the general case, F^^ , 
F"^ not appearing in the subterms Sh,i- In this case, we have that 

/^l,ni] =i2[s2i/^2 1X2 

that is, there is a unifier i.e., a substitution making ti and t2 equal. Consider the most 
general unifier (mgu) for ti and t2, this is given by tuples of terms, s'^ i, . . . , s'l^^^ and 

4,1, • • • >4,n2' ^^^^ *^at *lKi/^l,l, • • • ,s'l,ni/^l,ni] = *2 [^i /^2,1 , • • • , ^.na /^^S.na] • 

^r(4,i> • • • > 4,„i) ■■ (^1) ^ K) and Ff (s'2_i, . . . , 4^„J : (na) ^ (m') complete t, 
and t2 into a commuting square that is also an RPO, in fact any other pair of arrows 
completing ti and t2 into a commuting square and factorizing the original one needs 
to be oftheformFi"n<i,---,<nJ ■ (ni) - {m") and F"^ (4^, . . . , 4„J : (m) ^ 
(m"), with the two sequences (s'/ j) and (s2i) defining a unifier for ^1,^2- The unique 
arrow factorizing the two commuting squares is F"^' (s'(', . . . s'^,), where {s'-') is given 
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by the mgu property. 

^r(<i,...,<„,) 



^2 \^2,l'---'^2,n2) 




(iii) In this point we consider all the remaining cases, that is, where: si = ci(si^i, . . . , si,jki), 
S2 = C2(s2,i, . . . , 52,^2) aiid either F^^ and do not appear in corresponding sub- 
terms, or ci = or C2 = -F^^. Let us consider the term s'l obtained from si by 
substituting any maximal subterm So not containing F^^ by a ground variable Xg^. 

For example, if si = ci(si,i, C2(.si,2,i, i^"^ (si,2,2,i, ■51,2,2,2), 51,2,3)) then s'^ is the term 
ci(Xs^_j,C2(Xsj 2_j,F"i(Xsj 2_2,n-'^si,2,2,2)>-'^si,2,3))> ^ud aualogously for the term S2- 
Let s'/ = s'l oti, and s'/ = s'r,ot2- Now we have that: s'/fs, /Xg ^ , . . . , s, , ^ /Xs - 1 = 

Soiso r-ZXc s, - /Xc _ 1 that is, there exists a unifier for s'/ and s,, we can 

consider the most general unifier, given by a pair tuples of terms s' 

1,(1 i,tmi 

and Sr, i' ..... S-, . By repeating the arguments used at point (ii), we have that 
s'lW JXs s' ~ iXs . ] and s'o\s' ./X, , , . . . , s' . - /X^ , ] form an RPO. 

The proof for the general case is now almost immediate. The RPO for the square 

(mi, . ..mk) 




can be obtained by combining the RPO's for the k diagrams 



for 1 < i < k 




(n2,i,...n2j2) 

(t2,l,...,t2,J2> 



into a sequence. In turn, the RPO for these diagrams can be obtained by essentially 
repeating the construction presented for the unary case. Finally, it is immediate to prove 
that the presented construction preserve linearity and function-linearity of arrows. □ 
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The above proposition holds also for the case of linear second-order contexts and the 
prove remains almost the same. 

6.1. CL* as Second-order Rew^riting System. In this section, we consider the second- 
order context category for the combinatory calculus CL* and we show that the weak 
IPO lazy bisimilarity thus obtained coincides with the lazy observational equivalence on 
A-calculus, while for the cbv case we get a finer equivalence. Interestingly, the second-order 
open bisimilarity gives a uniform characterization also on open terms. 

Note that the terms of CL are defined by the signature T,cl = {K, S, app}, where app 
is the binary operation of application that is usually omitted. So the term SKK actually 
stands for app(app(S, K), K). 

First we deal with the lazy case, then we will sketch also the cbv case. 

6.1.1. The Lazy Second- order Reactive System. 

Definition 6.4 (Lazy Second-order Reactive System on CL*). The lazy second-order reac- 
tive system Cf* consists of: 

• the function-linear category whose objects are the lists with at most one element, and 
whose arrows e (n) are the terms of CL* with, at most, n (first order) metavariables, 

M" ::= Xi I . . . |X„ I K I S I K'M" | S'Af" | S"M"M" | M'^M" 

and whose arrows (m) — s- (n) are the second-order contexts defined by: 

(Cm,n .._ F{M^,...,M^) I M"C™'" I C"^'"M" 

• the reactive contexts are all the second-order applicative contexts of the shape 

F(Mr,...,M;^)iVr...iV,«; 

• the reaction rules are 

KXi K'Xi K'XiX2 Xi 

SXi S'Xi S'XiX2 S"XiX2 S" XiX2X-^ {XiX^){X2X^) 
where KXi, SXi : e ^ (1), K'XiX2, S'XiXa : e ^ (2) and S"XiX2X3 : e ^ (3). 

Second-order contexts as defined above can be represented by C[F{Mi, . . . , Mm)], where C[ ] 
is a unary first-order context on CL* (with metavariables). To maintain the notation for con- 
texts used in Sections 4, 5, in the sequel a second-order context C[F(Mi, . . . , Mm)] ■ (m) — > 
(n) will be more conveniently written as C[ ]$, where is a substitution s.t. 0(Xi) = Mi for 

all i = l,...,m, moreover we write M M' iff C[M0] M'. Given Proposition 16.31 
and the underlined RPOs construction, we have: 

Corollary 6.5. The reactive system Cf* has redex RPOs. 
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term M 


IPO contexts 


reactive IPO contexts 


X 


[]iAY/X},[]\A/X}Y, Am[]0 


[ ]iAY/X}, [ ]{A/X}Y 


XPoP 


[]{Ay/x}> ATO[]0 


[ ]{Ay/x} 


CP , M value 


[hX, ATO[]0 




CP, M not value 


[]0, AyCi[]0 


[]0 



where 

A G {K, S, K'Zi, S'Zi, S"ZiZ2 \ Zi,Z2 fresh} 
C € {K,S,K',S',S"} 

Ci[ ] ranges over C[ ]::=[] | C[ ]Z \ ZC[ ] 

Figure 5: Second-order IPO contexts for the lazy CL*. 



Example: Let M = XMi. Some of the IPO reductions of M are the following: 
XMi ^ ''^""^ S"yMi; XMi ' S"Miy; XMi ' ^^^'^"''^ {YMi){ZMi); 



l{s"y/x}^ 



]{s"/x}^^ 



(yZ)(MiZ); XMi ' — (MiZ)(yZ); 
Notice that [ ]{Kyi...y„/x} is an IPO context for any n. 



XMi 



]{Ky/x} 



K'yMi; 



In general, the IPO contexts are summarized in Figure [5l 

Using Proposition [331 we can prove that the weak IPO bisimilarity —fj is a congruence, 
and it has a simpler characterization in terms of applicative contexts. Namely, we can 
consider as list extension category the category of all function-linear term contexts. In the 
alternative notation, a second-order linear term contexts can be written as C[_q^, . . . , 
where C[_i, ...,_„] is a first-order multi-holed context and 9i, . . . ,9n are n substitutions, 
each one acting on the term put in the corresponding hole. By repeating the arguments for 
the first-order case, one can show that any second-order linear term context either is IPO 
uniform or it has a reactive index. Then, by Proposition 13.5^ we have: 

Proposition 6.6. 

(i) For all terms of CL* M,N, for any substitution 9 and for any (possibly open) first- 
order context C[], 



M 

2* . 



,2* 



N 



.u - ^ C\M9\ ^tf C[N9\ . 

(ii) = where denotes the weak IPO bisimilarity, where only reactive IPO 

contexts are considered (see FigureW^. 



By Proposition I6.6( ii) above, the notion of IPO bisimilarity turns out to be much simpler, 
but it is still infinitely branching (when the term is of the shape XPqP we have infinitely 
many IPO contexts [ However, one can prove that also the contexts [ Ij^y/x}' 

for any [y[ > 1 can be eliminated. This requires an "ad-hoc" reasoning: 
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term M 


IPO contexts 


X 


[ ]iA/X}Y 


XPoP 




CP , M value 




CP , M not value 


[h 



where 

A G {K,S,K'Zi,S'Zi,S"ZiZ2 I Zi,Z2 fresh} 
C G {K,S,K',S',S"} 

Figure 6: Finitely branching second-order IPO contexts for the lazy CL*. 

Proposition 6.7. The lazy weak IPO bisimilarity has a finitely branching characteri- 
zation in terms of the second-order IPO contexts of Figure\^ 

Proof, (sketch) Let be the reduced bisimilarity obtained from by not considering 
the contexts [ Ij^y/x}' I I — Then C In order to show the converse, 

one can first prove that the following is a weak IPO bisimulation: R = {(M', N') \ 36. (M' 
Me A N' NO A M N}, where M ^ N means that M and N are ^^-convertible. □ 

Finally, we are left to prove that the second-order weak IPO bisimilarity exactly recover 
the lazy observational equivalence. More in general, we will prove that the two equivalences 
coincide on open terms. Namely, we can view open terms with n free variables as arrows 
from e to (n) (by identifying variables with metavariables). Thus we have directly a notion 
of equivalence on open terms. We will show that this equivalence coincides with the usual 
extension to open terms of the observational equivalence by substitution. This gives a 
uniform finitely branching characterization of the observational equivalence on all (closed 
and open) terms. 

Proposition 6.8. For all M, iV G A, M^iN T{M) T{N). 



Proof of Proposition 16.81 We will show that coincides with the natural extension 
to open terms of the first-order IPO bisimilarity 2:^^*^ of Section 15.21 

Definition 6.9. Let be the extension of to open terms of CL* defined by, for all 
M, N CL*-terms such that FV{M),FV{N) C {Xi, . . . , X„}, 

M^^jN iff V0 : {Xi, . . . , Xn} {CL*f. MO NO . 
Lemma 6.10. C 

Proof. We show that 7^ = {{MO, NO) \ M -f* jv a MO^Nd G {CL*f} is a first-order 
bisimulation. From M N , by Proposition l6.6l we have MO NO. Assume MO^iM', 
since MO c^J* NO, then N9^iN', M' ~|*^ N' and {M',N') G 7^. Now assume MO'^^'iM' , 
then M9^-XiM" with M"[P/X] = M', since MO -f; N9 then also NOU-'^N" with M" 
N". Thus NOWiN' and N"\P/X] = N' is closed. Thus {M',N') G 7^. □ 
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Lemma 6.11. Let M G CL*, M M' . Then M^^M' . 

Proof. The proof follows from the fact that \/9. MO M'9 and is closed under □ 
Lemma 6.12. C 

Proof. We show that 7^ = {(M, A^) [ M~^^A^} is a second-order bisimulation. IfM — >/ M', 
then there are two cases. 

(i) M = CM, for a combinator C on CL*. Then 9 = fl), and for any closing 9 and closed P 

such that \X\ = |P|, M9 M" and M" = M'9[P/X]. Since M9 N9, then N9 4i N" 
and M" ~^*^ A^". There are two subcases: either X = [] oi X = X. In the first subcase, we 
have M — >/ M' (second-order) and N ^ N (second-order), thus by Lemma [6TT]M'~^^A^, 
and hence {M',N) € TZ. In the second subcase, i.e., X = X, M is a value different from a 
variable, then one can check that also N must reduce to a value different from a variable, 

thus N^'^^ N' and A^" = N'9[P/X]. Thus M'Sl*i^N\ and hence {M',N') G 7^. 

(ii) M = XM. Since for any closing 9, M9 N9, then also N N' . Moreover, for 

any 9 closing M9,N9, for any P such that |P| = \X\, we have M6l^ -^7 M" , N99 A^", 
M" = M'9[P/X], N" = N'9[P/X\. Thus for all t . M't A^^', hence {M',N') £R. □ 

6.1.2. The Chv Second- order Reactive System. The main difference between the cbv and the 
lazy case is that the variables in the cbv case are meant to represent values, consequently 
cbv substitutions have to map variables into values. 
First of all, the values on CL* are defined by: 

y ::= X I K I KV | S | S'F | S"VV . 

Definition 6.13 (Cbv Second-order Reactive System on CL*). The chv second-order reac- 
tive system C^* consists of: 

• the function-linear category whose objects are the lists with at most one element, and 
whose arrows e — > (n) are the terms of CL* with, at most, n (first order) metavariables, 
and whose arrows (m) — > (n) are the second-order contexts defined, briefly, by: 

C ::= F{Vi,...,Vm) \ MC \ CM 

where the values Vi, . . . , Vm and the term A^ are built using n variables. 

• the reactive contexts are defined by 

B ::= F{Vi,...,Vm) \BM \VB; 

• the reaction rules are 

KXi K'Xi K'XiX2 Xi 

SXi S'Xi S'XiX2 S"XiX2 S" XiX2X^ {XiX2){XiX^) . 
By Proposition 16.31 have: 

Corollary 6.14. The reactive system C^* has redex RPOs. 
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term M 


IPO contexts 


reactive IPO contexts 


X 


[ ]{A/X}^: A[ ]0, RCl[ ]0, 




M a value but not a variable 


[]0X, A[]0, i?Ci[]0 


A[]0 


M reducible 




[]0 


M contains a critical variable 


[ ]{A/Cr(M)}-, RCl[ ]% 


[ ]\A/Cr{M)} 



where 

A G {K, S, K'Xi, S'Xi, S"XiX2 I Xi, ^2 fresh} 
R ranges over ::= AZ \ XR \ RT 
Ci[ ] ranges over C[] ::= [] \ C[ ]T \ TC[ ] 
with T ranging over T ::= X \ (TT) 



Figure 7: Second-order IPO contexts for cbv CL*. 

As in the lazy second-order context C : {m) — > (n) will be more conveniently 

denoted by C[ ]o, where C[ ] is a unary first-order context and 6* is a cbv substitution, i.e., 
s.t. 9{Xi) is a value, for alH = 1, . . . , m. 

According to our definition, there are terms that are neither values nor they are re- 
ducible (they do not contain any redex), the term XY is an example. A term M of this 
kind can be transformed in a reducible one by substituting a single specific variable with a 
value. We call critical variable a variable of this kind. 

Definition 6.15. The critical variable of a second-order term M, Cr(M), if it exists, is 
recursively defined by: 

Cr{V) = , 

Cr{XV) = X , 

Cr{VM) = Cr{M) , if M is not a value, 
Cr[MN) = Cr{M) , if M is not a value. 

The second-order IPO contexts for cbv are summarized in Figure [71 In that figure, the 
symbol R ranges over most general reducible terms. That is, any reducible term can be 
obtained by instantiating the variables of a term contained in that grammar. The symbol 
T is used to represent general terms; remember that variables represent general values. 

As for the previous case, by Proposition l3.5l and by considering as list extension category 
the category of all by-value function-linear term contexts, we have: 

Proposition 6.16. 

(i) For all terms of CL* M,N, for any substitution 9 and for any (possibly open) first- 
order context C[ ], 

M c^l} N =^ C[Me] c[Ne] . 

(ii) = — where —fj^ denotes the weak IPO bisimilarity, where only reactive IPO 
contexts are considered. 

It is important to notice that the reactive IPO contexts provide directly a finitely branching 
Its for the cbv combinatory logic (notice that, contrary to the lazy case, for the cbv case 
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IPO contexts of the shape [ ]|Ay/x}' -^^^ 1^1 — exist, since substitutions have to 

map variables into values). 

The cbv weak IPO bisimilarity turns out to be strictly included in the cbv contextual 
equivalence. Namely, if we consider 
T(Xx.x) = SKK , and 
T{\xy.xy) = S[S(KS)(S(KK)(SKK))][S(S(KS)(KK))(KK)] 
then T(Ax.x) T{Xxy.xy), however T{Xx.x) 9^^^ T{\xy.xy), because 

T{Xxy.xy) ^¥ S"(K'X)(S"KK) while T{Xx.x) X^X^ . 

The problem arises from the fact that in the second-order cbv bisimilarity we observe the 
existence of a critical variable, while in the contextual equivalence we do not. 



7. Final Remarks and Directions for Future Work 

There are several other attempts to deal with parametric rules in the literature. In his 
seminal paper |Sew02j . Sewell presents two different constructions, one based on ground 
reaction rules and the other based on parametric rules. The RPO construction can be seen 
as a categorical account of the ground rules construction. Parametric rules, in the form they 
are defined in |Sew02j . do not have an obvious categorical presentation. In |KSS05| . the 
authors introduce the notion of luxes to generalize the RPO approach to cases where the 
rewriting rules are given by pairs of arrows having a domain different from 0. Luxes can be 
seen as a categorical account of the parametric rules approach of Sewell. When instantiated 
to the category of contexts, the luxes approach allows to express rewriting rules not formed 
by pairs of ground terms but, instead formed by pairs of contexts (open terms), and so 
allowing parametricity. Compared to our approach, based on the notion of second-order 
context, the approach of luxes is more abstract and it can be applied to a wider range of 
cases (categories). However, if we compare the two approaches in the particular case of 
context categories, we find that the luxes approach has a more restricted way to instantiate 
a given parametric rule. This restriction results in a not completely satisfactory treatment 
of the A-calculus. It remains the open question of substituting the notion of second-order 
context with a more abstract and general one. This will allow to recover the extra generality 
of luxes. 

A possible alternative approach for dealing with the A-calculus in Leifer-Milner's RPO 
setting, it that of using suitable encodings in the (bi)graph framework |Mil06j . However, 
we feel that our term solution based on second-order context categories and CL is simpler 
and more direct. Alternatively, in place of CL, one could also consider a A-calculus with 
explicit substitutions, in order to obtain a convenient encoding of the /3-rule, allowing for a 
representation as a second-order reactive system. This is an experiment to be done. Here 
we have chosen CL, since it is simpler; moreover, the correspondence between the standard 
A-calculus and the one with explicit substitutions deserves further study. 
We have considered lazy and cbv strategies, however also other strategies, e.g. head and 
normalizing could be dealt with, possibly at the price of some complications due to the 
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fact that such strategies are usually defined on open terms. It would be also interesting to 
explore non-deterministic strategies on A-calculus. 
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